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Abstract 

We study a random graph Gn, which combines aspects of geometric random graphs and preferential 
attachment. The resulting random graphs have power- law degree sequences with finite mean and possibly 
infinite variance. In particular, the power-law exponent can be any value larger than 2. 

The vertices of G„ are n sequentially generated vertices chosen at random in the unit sphere in R^. 
A newly added vertex has m edges attached to it and the endpoints of these edges are connected to 
old vertices or to the added vertex itself. The vertices are chosen with probability proportional to their 
current degree plus some initial attractiveness and multiplied by a function, depending on the geometry. 

Keywords: Complex networks; Random geometric graphs; Scale free graphs; Preferential attachment; 
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1 Introduction 

Preferential attachment models are proposed by Barabasi and Albert [1] as models for large-scale networks 
like the Internet, electrical networks, telephone networks, and even complex biological networks. These 
networks grow in time, because, for example, new routers, transform houses, switchboards or proteins are 
added to the network. The behavior can be modeled by means of a random graph process. A random graph 
process is a stochastic process that describes a random graph evolving with time. At each time step, the 
random graph is updated using a given rule of growth, which will be specified later. 

In literature a number of different rules of growth are explored. For example, each time step we add or 
remove edges/vertices [2], or, more advanced, copy parts of the graph [9]. Furthermore, there is freedom 
in the choice how to connect endpoints of newly added edges. Mostly, one randomly chooses the endpoint 
over the vertices, or proportional to the degree. Another possibility is to assign to each vertex a fitness. In 
[3, 4] additive fitness is explored where one chooses proportional to the degree plus some (random) value. 
In [5, 10] multiplicative fitness is explored where each vertex has a random fitness and one chooses a vertex 
proportional to the degree times the fitness. In this paper we use a constant additive fitness and a variant 
of multiplicative fitness, depending on the distance between vertices. 

Many large networks of interest have power-law degree sequences, by which we mean that the number 
of vertices with degree k falls off as k^'^ for some exponent r > 1. The parameter t is called the power- law 
exponent. Depending on the value of r we classify the following three categories: the infinite mean case, the 
finite mean and infinite variance case, and the finite variance case, which corresponds to r S (1, 2), r €! (2, 3) 
and T > 3, respectively. 

These categories are of interest, because the behavior of the typical distance is determined by the power- 
law exponent r. Results in the literature show that if r € (1,2) the typical distance is bounded by some 
constant, if r G (2,3) the typical distance is concentrated around log log ri and if t > 3 it is concentrated 
around logn, where n is the number of vertices of the graph, sec [13, 14, 15, 16, 10]. 

A large number of graph models have been introduced to describe complex networks, but often the 
underlying geometry is ignored. In general it is difficult to get rigorous results for properties like the degree 
distribution, typical distances or diameter, even if one disregards the geometry. However, in wireless ad-hoc 



*Dclft University of Technology, Electrical Engineering, Mathematics and Computer Science, P.O. Box 5031, 2600 GA Delft, 
The Netherlands. E-mail: H. vandenEskerStudelf t .nl 



networks the geometry is of great importanee, since in these networks nodes are spread over some surface 
and nodes can only communicate with neighbors within a certain range, depending on the geometry. 

In this paper we wih rely on the geometric preferential attachment (GPA) model introduced in [6] and 
extended in [7] by the same authors. The GPA model is a variant of the well known Barabasi- Albert (BA) 
model. In the BA model new vertices are added to the graph one at a time. Each new vertex is connected to 
m of the existing vertices, where we choose to connect to an old vertex proportional to its degree. In the GPA 
model each vertex has a position on a surface, and we choose to connect to an old vertex proportional to its 
degree times a non-constant multiplicative value. This multiplicative value depends on the distance of the 
old vertex and the newly added vertex. For instance, let the multiplicative constant be 1 if the vertices are 
at distance at most r„ , and otherwise zero. The latter attachment rule essentially describes the construction 
of a simplified wireless ad-hoc network. 



1.1 Definition of the model 

In this section we will introduce the Geometric Preferential Attachment model with fitness (GPAF). The 
GPAF model is described by a random graph process {0^1^=01 which we will study for large values of n. 
For < a < n, each vertex of the graph Ga = {Va,Ea) is positioned on the sphere 5" C M'^. The radius of 
the sphere S is taken equal to 1/(2y^), so that, conveniently, Area(S') = 1. The vertices of the graph Go- 
are given by Va = {1, 2, . . . , cr} and Ecr is the set of edges. The position of vertex v G Va in the graph Go- is 
given hy Xy £ S and the degree at time a is given by c?o {v) ■ 

In total we need 4 parameters to describe the GPAF model. The first parameter of the model is m = 
m{n) > 0, which is the number of edges added in every time step. The second parameter is a > 0, which 
is a measure of the bias toward self-loops. The third one is 5 > — m, which is the initial attractiveness of a 
vertex. And, finally, the fourth parameter is a function Fn '■ [0, tt] — > where the value Fn{u) is a indicator 
of the attraction between two vertices at distance u. 

Before we give the model definition, we first will explain the use of the parameter a. Assume that the 
graph Go is given, consisting of the vertices Vo. We construct the graph Go+i by choosing vertex Xo+i 
uniformly at random in S and add it to Go with m directed edges emanating from the vertex x^r+i- Let, for 
cr = 1, 2, . . . , n — 1, 

TaA^) = + ^)Pn{\Xy - (1-1) 

where \xy — m| € [0,7r] is the angular distance from u to itg along a circle with radius l/(\/2) over the 
sphere S. Furthermore, let the endpoints of the m emanating edges be given by the vertices fo+ii ■ ■ ■ i ^o+'i- 
Intuitively, we would like to choose the endpoints at random (with replacement) from V^, such that v Cz 
is chosen with probability 

TP / (.) _ ^ _ idajv) + S)Fni\Xv - Xa+l\) 
ro(Wo+l - Wj - r (X M^) ' 

where Po( ■ ) = P( - |Go,a;o+i)- However, the above given rule of growth is not well-defined. To see this, 
consider the simplified model for wireless ad-hoc networks, i.e., Fn{x) = l{x<r„}. Then, for any a, there is 
a positive probability that there are no vertices within reach of the newly added vertex x^+i and therefore 
To,„(xo-i-i) = 0. 

Introducing self-loops solves this problem and for this the additional paramater a is introduced. We will 
follow the solution given by the authors of [7] for the GPA model: 



Rules of growth for a > 0: 

Initial Rule ((7 = 0): To initialize the process, we start with Gq being the empty graph. 



Growth Rule (at time a + 1): We choose vertex Xa+i uniformly at random in S and add it to Go 
with m directed edges emanating from the vertex cco+i. Let the endpoints of the m emanating edges 
given by the vertices Wo-iii ■ ■ ■ i ''^a+i- We choose the endpoints at random (with replacement) from V^, 
such that V G Va is chosen with probability 

ro (do(w) +'5)F„(|x„ - xo+il) ^^2) 



max{ro,„ (cco+i ) , q;E[To,„ {xcr+i )] /2} ' 
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and 



for i e {1, 2, . . . , m}. 



f (i) -1-1^ 1 -'-g•.^^^,^^g•+17 o^ 



The above given random graph model is well defined, since the denominator is always strictly positive. 
Indeed, the following lemma calculates the value of E[T^_n{xa+i)] which is strictly positive. 

Lemma 1.1 For any fixed point v £ S, 

[ Fn{\v - U\) du = In, (1.4) 

Js 

where 



1 r 

In = t: / i^„(a;)sina;da;. 

^ Jx=0 



lx=0 

As a consequence, if U is a randomly chosen point from S, then 

E[T,^n{U)]=In{2m + S)a. (1.5) 

Proof. First note that /„ does not depend on v due to rotation invariance. Thus, without loss of 
generality, we can assume that v is at the north pole of the sphere. Using spherical coordinates, we find 
du = Tg sin 6* d6' d^?, where tq = 1/(2y^), and |u — u| = 9, so that: 

/ Fn{\v-u\)du= / / rlFn{e) sine dO dtp ^2Trrl / F„ (61) sin d6' = /„ . 

For the second claim we calculate the expected value of Ta^n{U), (1.1), conditional on the graph Go-: 

Wo,n{U) I G„\ = ^(d.(t;) + (5)E[i^„(|a;„ - U\) \ G„\ 

a 

= In Y^{d^{v) +5)= In{2m + d)a, 

v=l 

where we apply (1.4) on E[i^„(|a;i, — U\) \ Gc], since v < a and, therefore, 

E[F„.{\x, - U\) \Ga]= [ Fni\x, - u\)du = In. (1.6) 



Hence, E[T<,,„([/)] = E[E[T,,„((7) | G^]] = /„(2m + S)a. □ 
We use the abbreviations, for u E S, 

Mcr,n(u) = max{ro._„(M), a8/„(T} and Acr,„(w) = -F„(|u - Xo.+i|), (1.7) 

where 

e^Q{S,m) = {2m + 6)/2. (1.8) 
As a consequence, we can rewrite the attachment rules as 



P (i>« -v)- (dAv) + S)A,,,M , - a + 1^ - 1 - ^"'"("^"+1) 



for i e {1, 2, . . . , m}. 



Remark 1.2 In the above description we add directed edges to the graph and therefore we construct a 
directed graph. For questions about the connectedness and diameter of the graph we ignore the direction of 
the edges, but we need the direction of the edges in the proofs of the main results. 
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Remark 1.3 In this paper we will illustrate the theorems using the canonical functions 



F:^\u)^\, Fli\u) = M\u\<r„} and FS\u) = \ (1.10) 

max{ri ^ ,u]P 

where r„ > n"^"^!"^ , e < 1/2, ?/> < 1/2 and (3 <E (0,2) U (2,oo). The canonical function F,f^ implies that the 
vertices are chosen proportional to the degree, and, furthermore, the geometry is ignored, the model is then 
equivalent to the PARID model, see [3] or section %1.2. The function F^'' implies that a new vertex can only 
connect to vertices at distance at most r„ . Finally, canonical function F^'' implies that vertices are chosen 
proportional to the degree, and, in contrast to Fll'\ will prefer vertices close to the new vertex, since F^'' is 
non-increasing as a function in u. 



1.2 Heuristics and main results 

Using the results of [7], which is a special case of our model when (5 = 0, together with the results of the 
PARID model, introduced in [3], we will predict how the power-law exponent of the degree sequence will 
behave. 

Consider the PARID random graph process {G'^}cr>o as introduced in [3] with constant weights equal to 
m. For this special case, we give a brief description of the model. 

The construction of the PARID graph G'„ = iV^, E'^) depends on the graph G'„_i. The rule of growth is 
as follow: add a vertex to the graph G'„_i and from this vertex emanates m edges. The endpoints of these m 
edges are chosen independently (with replacement) from the vertices of G'^^^. The probability that vertex 
V G V"^_i is chosen is proportional to the degree of vertex v plus 5, more specifically: 

P (choose vertex w I Gl) = ^^2M±A. (1.11) 

(2m + o)(T 

li a < 2, 5 > —m and F„ = F™, then the GPAF model coincides with the PARID model where the 
weight of each vertex is set to m. Note that, for the chosen parameters, 

A,.n{x,) = F^dx, - x,+i\) = 1, = i / sin(a;) dx = 1, 

^ Jx=0 



and 



ae/fff < 2ea = (2m + 6)a = ida{v) + S) = n"},{x„+i), 



thus M™„(xo-+i) — r^"Jj(x'cr+i) = (2m + 6)a. Therefore, the equations (1.9) turns into (1-11), since 

P.(.- , = .) = P.(choose vertex .) = KM + ^ ^(!i±A. 

Mi,n(a;ff+i) (2m + d)a 

Furthermore, note that for these parameters there are no self-loops, since 

P.(.,+,-a + lj-l M<«>(..+i)" 

For the PARID model, we know that the power-law exponent is 3 -I- (5/m, thus we expect that the power- 
law exponent in our model is3 + (5/mifa<2 and F„ = F^°'. For a > 2, (5 = and F„ satisfying some mild 
condition, see (1.12), we know from [7] that the power-law exponent is 1 -I- a, which is independent of Fn. 

We will show in this paper that the power-law exponent is given by 1 -f a(l -I- 6/ 2m), which generalizes 
the two mentioned papers [6, 7]. More precisely, let Ni-{a) denote the number of vertices of degree k in G^ 
and let iVj,(cr) be its expectation. We will show that: 

Theorem 1.4 (Behavior of the degree sequence) Suppose that a > 2, 6 > — m = m(n) and in addition 
that for n — > oo, 

/ FnixYsinxdx = O (n'^Q , (1.12) 

Jx=0 

where 6 < 1 is a constant. Then there exists a constant 71 > such that for all k ~ k{n) > m, 

ex /,7i \ l+C({l+i5/2m) 

iVfc(n) =ne^'=(™'"'^) (-j +Oin^-''), (1.13) 
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where (j)k{Tn, a, 5) tends to a constant (jioci'm, a, 6) as k oo. 

Furthermore, for each e > and n sufficiently large, the random variables Nk{n) satisfy the following 
concentration inequality 



Remark 1.5 Note that the power-law exponent in (1-13) does not depend on the choice of the function Fn- 
We will see in the proof, that Fn manifests itself only in the error terms. 

All the given canonical functions in Remark 1.3 do satisfy the condition given by (1.12); it should be 
evident that for F^f the constants In and 9 are given by 7^°* ~ 1 and 0'°* = 0, respectively. Furthermore, 

in [7] it is shown that one can take I^^ ~ -f,?' = C (1) G (0, 2), and ^ ^/ > 2, and, 

hence we can take 0'^' = 0, 0'^' = and 0'^' = 2'ip, respectively. 

Before we consider the connectivity and diameter of G„, we place some additional restrictions on the 
function Fn. These restrictions are necessary to end up with a graph which is with high probability connected. 
Keep in mind the function F^\u) = l{|M|<r„}, then it should be clear that r„ should not decrease too fast, 
otherwise we end up with a disconnected graph. 

Let pn = /c(/x, n) be such that 



for some ^ € (0, 1]- 

We will call Fn smooth (for some value of fi) if 

(51) Fn is monotone non-increasing; 

(52) npf^ > L log 71, for some sufficiently large L; 

(53) pf^Fn{2pn) > c^In, for somc constant C3 which is bounded from below. 

Before stating the theorem, we will give an intuitive meaning of p„. To that end, consider the function 
F^^{u) = l{u<r„} and use the fact that if r„ = O (n"-^/^"*^) for some e > 0, then the limiting graph is 
not connected, see [11]. It should be intuitively clear that in the limit, each newly added vertex x„ should 
connect to at least one other vertex. Thus, there should be at least one vertex within distance r„ of a;„. At 
time n there are n — 1 vertices and the probability that at least one of these vertices is at distance at most 
Tn of vertex a;„, denoted by Pc{n,rn), is at most C{n — l)r^ for some constant C. On the other hand, we 
see that if r„ = O (n"^/^"*^) then pc{n,rn) = O tends zero for large n, and, as a consequence, in the 

limit the graph is not connected. If, as is our assumption, r„ > n*^"^/^, then pc{n,rn) 00. 

Interpret p„ for general Fn as the radius. The condition that Pc{ri, r„) — s- cx) is replaced by npn > Llogn. 
Then, intuitively, condition S2 implies that for general Fn the value Pc{n,pn), does not tend to zero and 
implies that the limiting graph is connected. The conditions SI and S3 are technicalities, combined they 
ensure that the 'area' due to the radius pn is sufficiently large: condition SI states that Fn is monotone 
non-increasing and combined with S3 one can show that the 'area' within radius 2p„ is (2p„)^i^„(2/9„), which 
is at least 4c3/,i. 

Theorem 1.6 If a >2 and Fn is smooth, m > Klogn, and K is sufficiently large constant, then with high 
probability 

• Gn is connected; 

• Gn has diameter O {log n/ Pn). 

Remark 1.7 All the canonical functions are smooth. It should be evident that one can take for F^h /i*"' = 1, 
Pn' = 1 and Cg' 1 for FI°K For F^\u) one can take for example ~ 1/4 and p^' = r„/2 and Cg' ~ 1. 
Finally, F^^ is also smooth, we refer to [7] for the precise values o/pj^', /.i'^' and c'^\ 

We end with a sharper result on the diameter, however we, also, need stronger restrictions on the function 
Fn. We will call Fn tame if there exists strictly positive constants Ci and C2 such that 




(1.14) 




(Tl) Fnix) > Ci for < X < tt; 



(T2) /„ < C2. 
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Theorem 1.8 If a > 2, 6 > —m and Fn is tame and m > Klogn, and K sufficiently large, then with high 
probability 

• G„ is connected; 

• Gn has diameter O (log^ n) . 

Remark 1.9 It should be evident that the function F^'' is tame, since one can take Ci = C2 = 1. // 
(3 € (0, 2) then we also have that ' is tame, since 



Remark 1.10 If we consider the configuration model (CM), see ^1.3, [15, 16] or the Poissonian random 
graph (PRG), see [12, 10], then the typical distance depends on the power-law exponent. If the power-law 
exponent is larger than 3, then the typical distance is of O (logn), where n is the number of vertices in the 
graph, and if the power-law exponent is between 2 and 3 then the typical distance is 0/ O (log log n) . On 
before hand, it is not clear if this holds for the GPAF model. Theorem 1.6, only states an upper bound on 
the diameter, independent of 5. 

If Fn = -Fjf = 1 and 5 e (— m, 0) then the authors of [14] show that the diameter in the graph Gn 
fluctuates around loglog n . If Fn(u) ~ F^^u) = l{\u\<r„}, then, intuitively, the diameter depends only on 
rn, since rn determines the maximal length of an edge, and we conjecture that the diameter is at least of 
order logn. 

1.3 Related work 

In this section we consider random graph models, which are related to the Geometric Preferential Attachment 
model with fitness (GPAF). 

As mentioned earlier the model is related to the Albert-Barabasi (BA) model. In the BA-model the 
power-law exponent r is limited to the value 3, which was proven by BoUobas and Riordan. 

Cooper and Frieze introduced in [2] a very general model preferential attachment model. In this model it 
is both possible to introduce new vertices at each time step or to introduce new edges between old vertices. 
Due to the weights with which edges of the new vertices are attached to old vertices and the adding of edges 
between old vertices, the power-law exponent r can obtain any value r > 2. 

In [4] the authors overcome the restriction r > 3 in a different way, by choosing the endpoint of an edge 
proportional to the in-degree of a vertex plus some initial attraction A > 0. This is identical by choosing 
the endpoint of an edge proportional to the degree of a vertex plus some amount S = A — m > — m, as done 
in the PARID model (cf. [3]). The power-law exponent in [4] is given by t = 3 + (5/m. Note that for (5 = 
we obtain the BA model. The authors of [3] show more rigorously some of the results in [4]. 

Both in [2] and in [3] it is allowed to add a random number of edges W, with the introduction of a new 
vertex. In case the mean of W is finite the power-law exponent is given by r = 3 + (5/E[VF]. Hence, if 
V{W = m) = 1 for some integer m > 1 then we see that t = 2 + 5/m > 2, since we can choose for S any 
value in {—m, 0). 

In [6, 7] the authors add geometry to the BA model, which corresponds to the GPAF model, introduced 
above, with 6 = 0. Due to a technical difficulty the model has an additional parameter, called a > 2. As 
a consequence of this restriction they only obtain power-law exponents greater than 3, since the power-law 
exponent is given by r = a + 2. 

By combining the GPA and PARID model, we obtain the GPAF model, introduced in this paper. Due 
to the additional parameter S, it is in this model possible to obtain any power-law exponent r bigger than 2. 

1.4 Overview of the paper 

The remainder of this paper is divided into three sections. In §2 we will derive a recurrence relation for the 
expected number of vertices of a given degree. In §3 we will present a coupling between the graph process 
and an urn scheme, which will be used in §4 to show that the number of vertices with a given degree is 
concentrated around its mean. 




and 
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2 Recurrence relation for the expected degree sequence 



In this section we will establish a recurrence relation for Nk{a) ~ ¥\Nk{<j)\j the expected number of vertices 
with degree k at time cr, which is claim (1.13) of Theorem 1.4. From this recurrence relation, we will show 
that 

7Vfe(cr) ^ apk, 

where pk ~ /j;-(i+a(i+'5/2ni))^ as fc — > oo. The proof of claim (1.13) depends on a lemma, which is crucial 
for the proof. This lemma states that for sufficiently large n the value Ma,nixcr+i) is equal to a0/„CT, 
with high probability. This is a consequence of the fact that Ta,n{xa+i) is concentrated around its mean 
E[Tcr,n(a;cr+i)] = 20/„CT < aQInO', see (1.5) and (1.8), which is the content of the next lemma. 

Lemma 2.1 If a > 2, S > —to, cr = 0, 1, 2, . . . , n, and U is chosen randomly from S then 
P (|T,,„([/) - E[T,,„([/)]| > e/„ (a^/" + ^1/2 log a) logn) = O {n~^) . 

The proof of this lemma is deferred to §4.1. 

We will allow that to depends on n, thus m = m(n), as already pointed out previously. In establishing 
the recurrence relation for N^^cr), we will rely on the derivation for (5 = in [7, Section 3.1]. 

At each time, we add a new vertex from which to edges are emanating, and for each of these to emanating 
edges we need to choose a vertex-endpoint. The first possibility for a vertex to have degree k at time cr + 1 is 
that the degree at time cr was equal to k and that none of the to endpoints, emanating from x^+i, attaches to 
the vertex. Furthermore, ignoring for the moment the effect of selecting the same vertex twice or more, the 
vertex could also have degree fc — 1 at time cr and having one endpoint attached to it at time cr + 1. Finally, 
it is also possible that the newly added vertex x^+i has degree k. The total number of vertex-endpoints 
with degree k is distributed as Bin (TO,pfe(cr)), where 



{k + 6)Acy,niXy) 



+1) 



(2.1) 



and Dk{a) C is the set of vertices with degree k in the graph Go-. Similarly, the number of vertex- 
endpoints with degree fc — 1 is distributed as Bin (TO,pj;_i(cr)) . If the newly added vertex Xrr+i ends up with 
degree fc, then this vertex has k — m self-loops. The number of self-loops, d^-i-i{(T + 1) — to, is distributed as 
Bin (to, p), where 

p=l- r,,„(a;a+i)/M<,,„(x,+i). (2.2) 

For k > m, this leads to. 



E[Nk{a + l)\G^,x^+i] = Nk{<j) ~ mpk{<j) + mpk^iicr) 

+ E[l{d„ + i(a+l) = fc} I Ga,Xa+l] + O {m.rik{G „ , X „ +l)) ; 



(2.3) 



where rjk{Ga,Xa+i) denotes the probability, conditionally on Ga, that the same vertex-endpoint is chosen 
at least twice and at most fc times. 

Taking expectations on both sides of (2.3), we obtain 



iVfc((T + l) =iVfc((T)-TOE 



E 



(fc + &)Ac,^n{Xv) 
M„^n{Xa+l) 



toE 



E 



veDk[<y) 

(fc - l + 5)A^^n{Xv) 
Ma^n{Xa+l) 



+ P {d^+i{a + 1) - TO = fc - to) + O {mE[qk{Ga.x^+i)]) . 



Let 



= {|T,,„(a-,+i) - 2e/„a| < Cie/„a^logn} , 
where max{2/a,6'} < 7 < 1 and Ci is some sufficiently large constant. If 

a>to = <o(n) = (logn)2/(i-^). 



(2.4) 
(2.5) 
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then Ba- implies that for sufficiently large n, 

T.,„(x„+i) < 2e/„(7 + Cie/„(T^ logn = 2e/„a {l + O (log-i n)) < aQI^a, 

since a > 2, and, hence, with high probability 

Mcr,nixa+i) = max{Tcr,„ (Xa+l ) , Q!9/„Cr} = a6/„cr. 

Next, we consider each term on the right hand side of (2.4) separately, for cr = 1, 2, . . . , n. For the first 
two terms on the right hand side of (2.4) we will use that Pfc(cr) is a probability and that P (S^) = O (n~^), 
for a > to, see Lemma 2.1, which yields 



E[pki<7)] - E[pk{a) I Bn] P (Bn) + E[pk{<j) \ B^^] P (S,^J E[p,(a) | 6„] + O (n^^) 
Also, using that Nkicr) < a, 

E[Nk{<j)]-E[Nk{<j)\B'f, 







E[Nk{cT)\B„ 

For (T sufficiently large, using (1.4), (1.7), (2.1) and (2.7) 

k + S 



(2.6) 



(2.7) 



E[pkia)\B„ 



-E 



aQInCT 

k + 5 



E 



/ -Fn(|a;i, - u\)du 
{k + 5)E[Nk{a)\Bn] {k + 5)Nk{^) 



B„ 



q;8(T 



a8(T 



+ C'(fcn-2) 



Combining (2.6) and (2.8), we obtain 



E[pk{cj)]=E 



E 



{k + (5)^o-,n(a:T,) 



{k + 5)Nk{cT) 



+ O {kn~^) 



(2.8) 



(2.9) 



for a > to ~ to{n) = (logri)^/'-^^''-'. The above statement remains true when we replace k hy k ~ 1. 

For the third term on the right hand of (2.4), one can show that for a > to, using that da-+i{xa+i) — m 
has a binomial distribution, see (2.2), thus 



' (da+iix^+i) - m = k - m \ Bn) 



m 
k — m 



E[/"™(l-p) 



2m~k 



m 
k — m 



Bn 



k — r 



2k-r. 



[1 + O (a-'-Hogn)) , 



where we refer to [7, §3.1] for the derivation of the above result. It follows that 

P (d.+i(x,+i) = fc) = P (d,+i (.T,+i) -m = k-m\ Bn) P {Bn) + 0{P (B'J) 

where we refer to [7] for the derivation of the error term O (cr'*'~^ logn). 
For the fourth and final term on the right hand side of (2.4), we use 



r]kiG„,Xa+i) = O min "(EE 



(z+|J|)^A.,„(x„)^ 

M^,n{Xa+l)^ 



,1 



t=ni v£Di(ij) 

which generalizes Equation (5) in [7]. Using similar arguments that led to (2.9), one can show for 

(7>ti=ii(n)=?i(^+'')/27 and k < ko ^ koin) = n^^-'>^/\ (2.11) 
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that 



E[m7^fc(G.,x,+i)] = O = O (a^-i) . (2.12) 
\ ma J 

Substituting (2.9), (2.10) and (2.12) in (2.4), we end up with the following recurrence relation: 

— — jn Tfi — 

Nk{o + 1) = Nk{a) --^{k + 5)Nk{a)/a + (fc - 1 + <5)iV,_i(a)/a 

+ l{,n<k<2m}( J" \ (l-2a-i)'"'"(2a-i)''~" + 0(a^-ilog(n)), (2.13) 

for k > m and Nm-i{(j) = for all cr > 0. The above recurrence relation depends on a and fc. Consider the 
limiting case, i.e., a — s- oo, and assume that for each fc the limit 

Nkia)/a^Pk (2.14) 

exists. If this is indeed the case, then in the limit the recurrence relation (2.13) yields: 



Pk = —^[k - I + 6)pk-i 7^{k + S)pk 



+ l{m<fe<2m} (l-2a ^) (2a ^) 



where k > m and pm-i = 0. By induction, we then obtain, for k > 2m.^ 



^{k-l + 6) ^ k-1 + 6 ^ T{m + l + S+^)r{k + S) 
^ + ^ik + sf''-'^ k + 5+^^''-'~ T{m + 6)r{k + l + S+^y 



Using that r{t + a)/r{t) ^ t° for a £ [0, 1) and t large, we can rewrite the above equation as follow: 



Pk ^ (f>kim, a, S) (J^^ ^ (l>k{m,a,5) 



where (j)k{m,a,5) = 0(1) and tends to the limit (/)oo(w, a, (5) depending only on m, a and (5 as fc ^ oo. 
Finally, following the proof in [7, from equation (15) up to the end of the proof], which shows that there 
exists a constant M independent from n, such that 

\Nk{a) - pk<j\ < A/(ni-(^-'')/4 + logn), (2.15) 

for all < cr < n and m < k < ko{n). Thus, the assumption (2.14) is satisfied. By picking 71 > sufficiently 
small, we can replace the right hand of (2.15) by n^~^^ , and one obtains the claim (1.13). 



3 Coupling 

In this section we make preparations for the proofs of Lemma 2.1 and the concentration result in Theorem 
1.4, see (1.14). In this section we take r e {1, . . . , n} fixed and we consider the graph process up to time t — 1 
resulting in the graph Gr-i- At time t we apply the Growth Rule, see §1.1, twice on Gr-i, independently 
of each other, which results in the graphs Gr and Gr- The idea is to compare the graphs Gt and Gr over 
time by considering G^ and G^ for t < a < n. To this end, we will introduce two urn processes. The urns 
consist of weighted and numbered balls. Instead of choosing a vertex-endpoint v G V^+i at time a+1 by (1.2) 
and (1.3), we will draw (with replacement) a ball proportional to its weight and then the vertex-endpoint is 
given by the number on the ball. 

The coupling between the urns will be introduced in four steps. The first step is to introduce for any 
cr > T two urns. Secondly, we will introduce a probabilistic coupling between the two urn processes. Thirdly, 
we will describe the coupling between the graph processes Ga, Ga and the two urn processes. Finally, we 
consider the vertex-endpoints w^*' and ?)^'', for i = 1,2, ... ,m, in the graphs Ga and Ga, respectively, and 
we will calculate the probability that v'-^'' ^ v^*' . 
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3.1 The two urns 



In this section we describe the contents of the urns corresponding to the graphs G„ and G„, for a = 
r, r + 1, . . . , and we give an alternative way of choosing the vcrtex-endpoints using the urns. 

Fix two graph processes {Gs} and {Gs} such that the graphs up to time r — 1 are identical, i.e., Gs = G,, 
for s ~ 0, 1, 2 . . . , T — 1, and that Xg = Xs, for s ~ t + 1, t + 2, . . . , n. Thus, the points Xr and x-r will 
differ from each other, and, as a consequence, also, the edge sets Eg and Eg, for s ~ t,t + I, . . . ,a, will be 
different. Finally, we assume, without loss of generality, that Ta^n{xcr+i) < Tc^n{xcr+i)- 

Next, we will describe the contents of the urns Ua and given the graphs G^ and Gg-, and the newly 
added vertex Xa+i- We will use the following abbreviations: 

T^,n ^T^,n{xcr+i) and M„^„ = M^^n{Xa+l)- (3.1) 

Furthermore, if e is an edge, then we denote by TO(e) the endpoint of the edge. Thus, if edge e is added at 
time t, emanating from the vertex t, points to a vertex s e Vf then TO(e) ~ s. 

Contents of the urns: 

• For each edge e £ E^, such that TO(e) ^ t, there is a white ball in Ua of weight Aa-.n{xTo{e)) and 
numbered TO(e). Similarly, for each edge in e S E^, such that TO(e) ^ r, there is a white ball in C/o- of 
weight Aa-.n{xTo{e)) = ^(t,ti (a^To(e) ) and numbered TO(e). Observe that iTo(e) = 2^x0(6) since TO(e) 7^ r. 

• For each vertex v G Va\{T} there is a red ball in each of the urns 1/^ and [/o- of weight (m + S)Aa-.n{xv) 
and numbered v. 

• For the vertex r there is in Ua- a purple ball of weight (^^(t) + S)A^^„{xt) and number r, and in 
there is an orange ball of weight [da-ir) + S)A^_„{xt) and numbered r. 

• For the vertex a + I each of the urns and contain a green ball of weight {aQInC — Ta-^n)'^-, 
where (•)+ = max{0, •}, and numbered a + I. Furthermore, we add only to Ua a blue ball of weight 
{{aQInO- - Ta.n)^ - {aeind - fa. and numbered cr + 1. 



Remark 3.1 The total weight of the white and red balls in are given by 
Ac,_„(a;To(e))l{To(e)7^T} and 



^ {m + 5)Aa,n{x^), 

veVa\{r} 



respectively, and the weight of the purple ball in Ua can be rewritten as 

(dair) + 5)Aa,n{xT) = ^ A^-^n (xTo(e) ) l{To(e)=r } + (to + d)Aa,n{Xr)- 



Therefore, the total weight of the white, red and purple balls in Ua is equal to: 

Aa,7i{x^o(e)) + 5)Aa^n{xv) = {da{v) + (5) Ac,,„ (a;„ ) = Ta-, 



vev„ 



Furthermore, from (1.7), and some easy calculation, the total weight of all the balls in Ua is Ma,n- Similarly, 
the total weight of the white, red and orange balls in the urn Ua is Ta^n o,nd the total weight of all the balls 
in Ua is, precisely Ma^n- 

The weight of a ball depends on the time ct, the color of the ball and the number on the ball. Let 6 be a ball 
in Ua or Ua, then we define the weight function Wa as 



Wa{b) 



'^'r,nix^lb)) 
(m + 6)Aa,n{x^{t)) 
{da{Xr) + 5)Aaji{Xr) 
[daiXr) + 5)Aa^n{Xr) 
{aOInCr - fa,n)^ 



if b is white, 
if b is red, 
if b is purple, 
if b is orange, 
if b is green. 



(3.2) 



[ {{aQIna - Ta,n)+ - {aQIn<j - Ta,n)+)+ if b is blue. 
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where S,(b) is the number on the ball. Observe that the number and the color together determine the weight 
of a ball. 

We identify a set B C Ua or B C Ua- oi distinct balls by the set of pairs (c, k), where c denotes the color 
and k the number of the ball. For any set B of distinct balls, define 

\\B\\, = Y,w,ib). 

b£B 

We will draw the balls {^^'iFii with replacement from the urn Ua proportional to the weight. Let 
{^^'Irii be the sequence of balls drawn from Ua-, then it is easy to show that 

P imU) =v\U,)=F {v^;i, ^v\G,, x,+,) (3.3) 

and P = " I = P ^w^+i = V I G^, , for u € Vcr+i- As an example we will show (3.3) for 

V £ V'o-+i\{t, cr + 1}. Observe that in this case the left hand side of (3.3) corresponds to the probability on 
the event that we draw the red ball numbered v or one of the dij{v) — m white balls, thus 

(m + S)A^^nixv) + (dtj(w) - ■m)A^^„{xy) 



^ (mil) = y\u.) 



\\u.\\. 



K+1 =v\Ga,x^+i) 



Ilf/.IU 

by (1.9), since ||C/(j||cr = Alc^.n (see Remark 3.1). 

3.2 The joint distribution of drawing balls 

In this section we describe how we simultaneously draw the balls from the urns Ua- and U^- As before, we will 
assume that Ta^n < ?cr,„, or, equivalently, llJ/o-llo- < ||C^cr||cr, see Remark 3.1. In the last part of this section 
we calculate the probability of the event {fe^' ^ ^^'1, for i = 1, 2, . . . , m, and t < a < n, i.e., the event that 
the two balls 6^'' and 6*^'' in the i^^ draw do not agree on number or color, which we call a mismatch. 
Define the following sets 

Ra^Ua\Ua, Ca^U^nUa and La = Ua\Ua, 

where, as before, we compare the balls by color and number. 

Remark 3.2 By construction, L„ only contains white and orange halls, Ca contains only white, red and 
green balls, and contains only white, purple and blue balls. Furthermore, concerning the weights, we have 
the following relations 

\\Ca\U + \\Ra\U = \\U4a and ||C,||. + ||L,||. = ||C/.||„. (3.4) 

Next, we give the joint distribution of drawing balls from the urns Ua and Ua. 

The joint distribution: Draw, with replacement, m balls 6^^^]^, . . . , from Ua. For convenience we 
write 6'*' = b'^a+i f'^'' * ~ 1, . . . , m. For each i, we define 6*'' = V'^^i by 

• If e Ca then, with probability 

Kk, (3.5) 

we set = 6''', otherwise we choose 6*'' from La, i.e., we choose b € La with probabihty w{b)/\\La\\a', 
observe that the quotient in (3.5) is bounded by 1, because, as remarked earlier, ||C/cr||cr < llt^o-licr- 

• If G Ra, then we choose 6'*' from La, i.e., choose b £ La with probability Wa{b) /\\La\\a- 

The marginal distributions: Denote by ]P( • ) the joint probability measure under the above introduced 
couphng. Furthermore, let Pc( •)=]?(• \Ua, Ua]- We will show that under the coupling 



rib''' =b) = ^^^F{h^'' =h\Ua) (3.6) 
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and 

Wa_{b) 



(b^^)^b^ = ]!!^=p(iw^b\u,y (3.7) 



for b G and b E respectively. The claim (3.6) is true by construction. For the claim (3.7), if 6 G Ua, 
then this implies that b G or 6 G L^, but not in both. Firstly, assume 6 G Co-, then 



P,(6<^' = 6) = P,(6<^' = = 6) P^(6<^' = 6) = 

Secondly, if 6 G io-, then 

P. (6'^' = 6) = P,(6<^' = G a) hib'"' e a) 



fe<^' = b\b''> G P.(5<^> G R„) 



,ic/.iu; iic^-iu ii^c^iu iic^c^iu 

w<r{b) (diaiia + - 11(7,11, nail,) 



Wcr(&) 



l|i.lUI|c/alUI|(7,||, ||c/,||, 

where we used in the last step the relations given by (3.4). Hence, also, the claim (3.7) is true. 



3.3 The joint growth rule between coupled graphs 

Fix T G {1,2,..., n}, as before, and consider the graph process {GsYsZo- Let {GsYsZl be an identical copy 
of {GsYsZqi s-iid choose at time r the position and Xr in Gt and Gt, respectively, at random in 5", 
independently of each other. Using the urns, we will describe the growth of the graphs G, and G, over time. 

At time r we apply the Growth Rule, independently, on the graphs Gi-^i and Gr-i- Then at time 
(7+1, for cr > T, let Xc+i randomly chosen from 5* and set icr+i = Xa+i- Let t/, and ?/, the urns correspond 
to (G(j,x,+i) and (G(j,i,+i), respectively. Note that this is precisely the setting as described in §3.1 and, 
as a consequence, we can use the results of §3.2. Draw with replacement m balls, {b^a+i\l.=i^ from [/,, then 
the vertex-endpoints of x^+i are given {C(&J^+i)}™ i. We, also, draw with replacement m balls, {^J^+i}™ i, 
from [/,, and construct G,+i in the same way. 



3.4 The probability on a mismatch 

The event of a mismatch of vertex-endpoints in the graphs G, and G,, cr > r, can be expressed in terms of 
drawing balls from the urns C/, and [/,, since 

{Ci ^ Ci} = ^ ^{b^^U)] C {^1 ^ • (3.8) 

Thus, we will concentrate on the probability of a mismatch between the drawn balls from the urns. Without 
loss of generality, we assumed that HJ/a-Hcr < llt'crllo- or, equivalently, r,.„ < T,_„. Using the joint distribution 
of the urns, see Section 3.2, and (3.4), we obtain 



||;7,||, ' \\Ua\\a ||J7,||, j|;7,i|,' 

By (1.7) and Remark 3.1, we can bound the denominator on the right hand side of (3.9) from below by 

\\Ua\\a > \\Ua\\a ^ M.^r. > aSIna. 

Next, we consider the numerator on the right hand side of (3.9). The set only contains white balls and 
the orange ball, see Remark 3.2. Therefore, compare (3.2), the total weight of L, can be written as 
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where 



u 



{e : TO(e) ^ r}. 



Thus, the probabiUty on a mismatch between balls is bounded from above by 

aQI„a 



"cr+1 r= J — 



(3.10) 



(3.11) 



Remark 3.3 IfT^ n > T„_n, then it should be clear that one can interchange the roles of Ga and Ga in §5, 
which implies that for this case 



where f o- = U 



"<T+1 T Off+i I — 



{e : TO(e) ^ r}. 



(T,n {"^T ) 



< 



aQIn(T 



(3.12) 



4 Proof of the main results 

In this section we will prove the main results, i.e. Theorem 1.4, 1.6 and 1.8. The diameter results. Theorem 
1.6 and 1.8, can be proved almost immediately using the proofs in [7], but this is not true for Theorem 1.4. 
The proof of Theorem 1.4 relies on Lemma 2.1 and this takes more effort. 

This section is divided into 3 parts: in the first part we will give the proof of Lemma 2.1, then, in the 
second part, we will give the proof of the main results, and in the last part we show a bound on the number 
of expected mismatches, which is necessary for the proof of Lemma 2.1. Before doing so, we will consider 
the number of mismatches between G^ and Go-, for a > t > 1, where a perturbation is made at time t as 
defined in Section 3.3. 

At each of the times s = r, t + 1,...,(7— 1, we sample (with replacement) m balls from each of the urns 
Us and Us- After m draws we end up with the balls {6^''}™]^ and If the two balls 6^*' and 6*,*' in 

the i*"^ draw do not agree on number or color, then, as before, we call the draw a mismatch, i.e., {6^'' ^ ^i,''}. 
Let A J the total number of mismatches between the urns Ua and Ua, then 



S=T 1 = 1 

Furthermore, for u G S* define 

a m 



S — T i — 1 



(4.1) 



(4.2) 



Next, we will relate the expected values of (4.1) and (4.2). Fix any y € S and let U be randomly chosen in 
S, then 



E 



J^„(|x^(,u))-C/|)-F„(|x^(gw,-C/|) 



< 



< E 



(i^„(|x^(,(.,) - U\) + F„(|x^(g,.,) - U\)) 1{6«#S«} 

2 / F„(|2/-u|)dwl{6W#h("} 
. Js 



= 2/„E[l{bW5^&w}] 



where we used (1.4). Thus, 



E[A,(C/)] < 2/„^^E[l{6<-Vb<-'}] = 2/„E[A,] 



(4.3) 



S — T i — 1 



Lemma 4.1 Under the conditions of Theorem 1.4, let a > 1 and U randomly chosen in S , then for some 
constant C > 0, 



and, as a consequence, 
since (Q/m)^^ < 2. 



E[A;(C/)] < Cm/„ 



E[Al{U)] < Cm/„ 



log cr. 



(4.4) 
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The proof of the above lemma is deferred to §4.3. 

Remark 4.2 For the proof of the main result, we need that the number of mismatches is of o(a), which 
implies that the exponent in (4.4) should be smaller than 1, i.e., m/aQ < 1. For a > 2 and 6 > —to this is 
indeed the case: 

m 2m 2 

— < < — < 2, 

Q TO + (m + 6) TO 

thus m/aQ < 1. 

If S = 0, which is precisely the model introduced in [7], then the condition simplifies to l/a < 1, which is 
a weaker condition than the condition used in [7]: 2/a < 1. Nevertheless, we cannot get rid of the condition 
a > 2, because we need that the event occurs with high probability, see (2.5). 



4.1 Proof of Lemma 2.1 

In this section we will prove Lemma 2.1 using the Azuma-Hoeffding inequality, which provides exponential 
bounds for the tails of a special class of martingales: 

Lemma 4.3 Let {Xt-}^>o be a martingale process with the property that, with probability 1, there exists a 
sequence of positive constants {er}T>i such that 

\Xr - Xr-l\ < Cr, 

for all T > 1. Then, for every A > 0, 

V{\X„-Xq\ > A) < 2exp 



For a proof of this lemma, we refer to [8]. 

We will apply Lemma 4.3 by taking a Doob-type martingale Xr = E[Tc.n{U) \ Gt], where U is chosen at 
random in S. By, convention, we let Go be the empty graph, then 

Xo = E[r,.„(C/)] and X, = T^^U)- 

At each time step s we add a new vertex and to edges, see the Growth Rule in Section 1.1, call this an 
action. We call an action A acceptable if the action can be applied with positive probability. Furthermore, 
denote by A{G) the set of all acceptable actions that can be applied on the graph G. 
Clearly, 



\E[T,,n{U)\Gr-l]-E[T„,n{U)\Gr] 



< sup sup 

G^_i AeA{Gr-i) 



E[T,,„([/)|G,_i(A)]-E r,,„([/)|G,_i(i) , (4.5) 



where the first supremum is taken over all possible graphs Gr-i. Next, fix the graph Gr-i and let G-r = 
Gt-i{A) be the graph by applying the action A on the graph Gr-i- Similarly, define Gr = Gt-i{A). Thus, 
one can rewrite the right hand side of (4.5) as 



Ct = sup sup 

Gt_i A£A{G^-i 
AeAiG^-i 



Using the triangle inequality, the above implies, under the coupling. 



Cr < sup sup E 

AG^(G,_i) 



(4.6) 



We claim that, independently of G1--1, A and A, and, for ct > r. 



E 



T,,n{U) - f„,n{U) 1 < GTO/„(a/r)2/", 



(4.7) 
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where the proof of this claim is deferred to the end of this section. Thus, using (4.6) and (4.7), 

T=l T=l 

To show the above, let /3 = A/a, then /3 e (0, 2). If /? e (1, 2), then J2r=i '^"'^ < and, if /3 G (0, 1], then 

From Lemma 4.3 we then obtain for some constant Ci, 

P (|T,,„([/) - E[r,,,([/)] I > CiTO/„(a2/" +ai/2loga)(logn)i/2^ < 2e-2iog". 



By taking n sufficiently large, we can replace Ci(logn)^/^ by logn, which is, precisely, the statement of 
Lemma 2.1, given the claim (4.7). □ 

Proof of claim (4.7): Denote by d~{v) the in-degree of the vertex v at time a, and observe that 

d^{v)^d-{v)+m, (4.8) 

since each vertex has by construction m edges pointing outward to other vertices or itself. Furthermore, we 
denote by y*'' the position of the i"* vertex-endpoint at time s, thus 

y^''=a;,(.). (4.9) 

By construction of Gr and G,-, we can apply the coupling introduced in §3.3. Rewrite Tcr,„(C/), see (1.1), 
using (4.8), (4.9), and the coupling, as 



Ta,n{U) = d4v)Fn{\Xv - U\) + (5^ Kd^. - U\) 

v—1 v—1 
(7 a 

= d-{v)Fn{\x, - U\) + (m + 5)J2 Fn{K - U\) 

V—1 V—1 

am a 

= T.T. P-^\ys' - U\) + {m + S)Y^ F„(|x„ - U\) 



s—1 i—1 v—1 
am a 

= E E ') - u\) + ("^ + -5) E ^"(1^'' - ^D- 

s—1 i—1 v—1 

Up to and including time r — 1 both graphs arc identical, thus the absolute difference 



equals to 

a m 



+ im + S) {Fr,{\Xr - U\) ~ F„i\Xr ~ U\)) 



Using the triangle inequality and (4.2), we obtain 

T.AU) - f^AU)\ < A.(C/) + im + S) [Fnilxr - U\) + F„(|i, - U\)) . 
Taking expectations on both sides of the above display, and using (1.6) and (4.3), yields 



E 



\T^.n{U) - f„AU)\ < 2/„(IE[A,] + {m + S)). 



(4.10) 



Applying Lemma 4.1 on (4.10) finally results in 



E 



for some constant C. This is precisely the claim (4.7). 



□ 
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4.2 Proof of the main results 



In this section we show the main resuhs. The proof of Theorem 1.4 is almost similar to the proof of Lemma 
2.1. The diameter results, i.e, Theorem 1.6 and 1.8 will be proved by using the proofs in [7]. 

Proof of Theorem 1.4: The first part of Theorem 1.4, i.e., claim (1.13), has been proved in Section 2. 
For the second part, i.e., claim (1.14), we now give a proof, which is similar to the proof of Lemma 2.1. 
Therefore, we follow the proof of the previous Section 4.1, where we now choose Xt = K[Nk{n) \ Gr] instead 
of Xr = E[r^,„(C/) I Gr]- Similar to (4.5), we have that 



\E[Nk{n)\Gr-i]-E[Nk{n)\Gr] 



< sup sup 



E[Nk{n)\Gr-i{A)] -E Nk{n)\Gr-i{A) 



Using the coupling, we can bound the right hand side in the above display by twice the number of mismatches, 
since each mismatch can influence at most two edges. Thus, 

\E[Nk{n)\Gr-i] - E[Nk{n)\Gr]\ < 2E[Al] . 

Therefore, we can take Cr = 2]E[AJ^] and we, again, can apply Lemma 4.3, as done in the previous section, 
which proves claim (1.14) and hence Theorem 1.4. □ 

Proof of Theorem 1.6: The proof is almost identical to the proof of [7, Theorem 2]. To apply this proof 
for general S > — m, we only need to replace the constant C3 in [7] by C3, where C3 = C3/2 and C3 is the 
constant of condition S3, see Section 1.2. This will be explained in more detail now. 

Pick fi and pn — PnifJ-, F„) such that F„ is smooth for p, see conditions SI, S2 and S3, see Section 
1.2. Fix u G S* and denote by Ap^ the spherical cap with center u and radius p„, then there exists positive 
constants ci and C2, independent of such that 

V = / dwe [cipl,C2pl], (4.11) 

{w^S : \w — u\<.pn} 

which is shown in [7]. Furthermore, in [7] the authors consider the graph at certain time steps ts, where s 
is a positive integer, such that the area of the spherical cap is given by 

s/2 < Apjs < 3s/2. (4.12) 

In the proof of [7, Theorem 2], the essential step is the statement that the probability that Vt^ chooses vertex 
V & Vt^, assuming that \xt^ — s^ul < 2p„, is at least i.e., 

P(^;<;>=«|G,,_l)>^. 

as 

In our model this is still true, when we replace C3 by Cg = C3/2, since, using the assumptions SI, S2 and 
S3, (4.11) and (4.12), 

p (^m ^ „ I ^ w {dtAv)+S)Fr,{\xt^~x.,\) ^ im + S)Fni2p„) 

^ 2{m + 5)Ap^Fn{2pn) ^ 2(m + J)cip^F„(2p„) 
~ aQInS ~ aQInS 

^ 2(m + (5)ciC3 {m + 5)2cic^ 1 2ciC3 2cid^ 
~ aQs as 2 as as ' 

where we used that (m + (5)/e = 1 + 6/ {2m + 6) >l/2 for -m <S <0 and {m + S)/<d > I > 1/2 for S > 0. 
If we replace the constant C3 by in the proof of Theorem 2, then the proof of [7] holds without further 
modifications. □ 

Proof of Theorem 1.8: For S = the proof is given by the proof of Theorem 3 in [7]. The constant 
A = C1/C2 in the proof of [7, Theorem 3] should be replaced by A = (Ci + 5)/2G2, then the proof holds 
verbatim. □ 
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4.3 Bounding the expected number of mismatches 

In this section we will prove Lemma 4.1. In the proof of the lemma, we rely on two claims, which will be 
stated now. The first claim bounds for any vertex and all time steps the expected degree: 



IE[d^,(cr) +S] < mC 



where C is some constant and 



a = m/aQ. 

The second claim is a technical one, which bounds the expectation of 
from above. More precisely, for any cr > r, 

HQa] < In (e[A,] + EK(t) +S]+ E[d^{T) + 



29 



(4.13) 
(4.14) 

(4.15) 
(4.16) 



Next, we will assume that the claims (4.13) and (4.16) do hold and we will show that Lemma 4.1 follows 
from these two claims. After the proof of Lemma 4.1, we will prove both claims separately. 

Proof of Lemma 4.1: Let t < a < t, then the number of mismatches is recursively defined as 

m 

and; therefore, 

(4.17) 



since we draw the balls with replacement. 
Combining (3.11) and (3.12), yields 



Observe from (1.4) that 



< 



(4.18) 



E[da{T)A^^n{Xr) \ ^^(t),^^] = rf^ (t)]E[F( - a^^+l) | da{T),Xr] 

= d„{T) / F{\Xr - U\)du ^ da{T)In, 



and, hence 



Similarly, E 



E[(d,(T) + 5)A^,n {:xr)] - E[d,(r) + 5] /„. 
{da{T) + 5)Aa.n{xT) = E dcr (t) + (5 /„ . Thus, taking expectations on both sides of (4.18), and 



using (4.16), results in 



E[0,] + (E[d,(r) + <5] /„) + (E d,(r) + (5 /„) 



< 



aQInCr 



(4.19) 



Substituting (4.13) and (4.16) in (4.19), yields 



< 



E[A^] + (2e + 4mC(f7/r)°) 



for some sufficiently large constant C > 0. Therefore, we can bound the right hand side of equation (4.17) 

by, 



E[A,+i] < E[A,] 1 + - 
a 



a\ 2e + 4TOC(cr/T)" 



= E[A,] ( 1 + - ) + 
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where we used that a — m/aQ (4.14). l/a < 1/2 and m/Q < 2. Finally, by taking the constant C larger, 
we can replace the above inequality by 

E[A<,+i] < E[A,] f 1 + 



We will now prove an upper bound for E[Acr+i]. To this end, we consider the solution of the recurrence 
relation q{a + 1) = 9(o')(l + a/a) + b{a), for a > t, with initial condition (/(t) = c. The solution is given by 

r(a + a) ^^ 6(g)r(g + l) Tja + a)T{r) 
'^^^^ T{a) £^r(s + l + a)+'r(^)r(r + a)- 

The above solution implies, that if one takes 6(s) = Ct-^s"-^ < Ct-''^^0^, then 

^('^) ^r(s + l + a) r(cr) ^r(s + l + a) 

The summation can be bounded from above by logcr, therefore, for some constant C, 

E[A,] < c(^)" loga. 
This proves (4.4) and hence Lemma 4.1, given the claims (4.13) and (4.16). 
Proof of (4.13): Note that, sec (1.7) and (1.9), 

K[da+i{v) + S] = {da{v) + S)+ K[da+i{v) - d„{v)] 



□ 



(daiv) + S)[l + m 



< {d^{v) + S) [1 + m 



Therefore, by taking expectations on both sides in the above display, and using (1.6), the value of E[do-+i('t') + ^] 
is bound from above by 



E 



id^iv) + S) 1 + m 



EK,„(a:,)|G<,] 



aQIn 



1 



)n{d4a)+S)]. 



Thus, by induction, and using a = m/aQ, 

' a + a 



E[d,+i{v) + S] = 



E[(d,,(a) + S)] = p!''tnr/^?1 ^[^^"(^) + '^)] 
r{a + l)T(v + a) 



Finally, note that dy{v) < 2m and that (5 is a constant, which implies the claim (4.13). 
Proof of (4.16): Observe that for i = 1, . . . ,to and s > r, see (4.9), 



□ 



(4.20) 



since, Xi = Xi if i t and Xt ^ Xr- The above statement is stronger than (3.8). Using the definition of 
see (3.10), and (4.20), we have that 

a m 

he£,T s=T i=l 

and we can bound J2hei ^a-,nixh) in a similar way. Hence, we can bound (4.15) from above by 

cr m 

< ^ ^ Vy<'' } ( A,,n (yf ) 1{T.,„ ) <f.,4 + A,,„ (y « ) 1{T„,„ >^^^ 

S—T i—1 

S—T i—1 

<7 771 

+ l{T.,„<f„,„} J2 E l|yi''^S<n (A.,„(y<") - A,,„iy^;^)) . (4.21) 
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Next, we will show that the rightmost double sum of (4.21) can be bounded by 

(m + (5)(A<j,„(Xi-) - A^^niXr))- 

For this, we rewrite T^. ^ as 

(7 (T a 

Ta^n = '^{ds{(j) + 5)Aa,n{Xs) = ^ d'^ {(t) Aa ,n{x s) + (m + (5) ^ Aa^n{Xs). 



s=l 



s=l 



Note that 



a { a 7n 



s— 1 s— 1 L t—s i—1 ) 

a m ( t \ (7 m 

= EE E ^{^':'-^]A.A^s) = A.,„(yr')- 



t=l i=l k s=l 



Therefore, 



Ta,n = E E A'^Avf) + ("i + '5) E A-yA^s) 



S — l i — 1 s — 1 

and a similar result hold for Ta ^. The difference of these two expressions equals: 



or 



Ta^n - f^^n = ^1 E^^'^^"^^^") ~ ^'^Af:^)) + {m + J) (A<,,„ (x^) - A^^Xr)), 

s=l i=l 

(7 m 



s=l i=l 



Hence, 



l{T.,„<f„,„} ^ ^(A^,„(y^") + A^,„(y<")) < (to + (5)(A<,,„(xr) - A<,,„(x^)). 

s=l i=l 

Substituting (4.22) in (4.21) yields, 

<7 771 

Qa <Y^Y^{y'^Hy'i^A„Ayf) + (m + <5)(A 

s— r 1 

Taking the conditional expectation with respect to the graphs and Ga results in 

a 7n 

IE[g. I G,] = ^ ^ l{yi'Vsi"}E[^a,n(yr ) I G,, 

s— r i—1 

+ {m + 5) {t A^^xr) I G^, G^J + E A^^Xr) \G„,Ga 
For any fixed value x (z S and using (1.4), we have that 

e[a„Ax) I G,, G,] = t[Fi\x - x,+i|) I G,, G, 
which, in turns, yields that (4.23) can be rewritten as 



E 



Qa I Ga, Ga 



= ^» E E Hyi^'^yi''} + 2(m + <5) . 

\s=T i=l J 

Note that (4.20) implies 

(7 7n G TYi a 7n 

^ ^ l{!/i-'#S<-'} = E E + E E l{''i"'=«i"=-} < A. + da{T). 



(4.22) 



(4.23) 
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Thus, 



E 



Qa I G<j, Go 



,(a, + d,(r) + 2(m + ,5)) = /„ (a, + (rf„(r) + 6) + (2m + 5)) . 



For convenience, we will use the following weaker statement: 



E 



Qa I G„ < /„(A, + (d,(T) + ^) + (d,(T) + 5) + 26), 



where we replaced (2m + 6) by 20, see (1.8). Finally, by taking the expectation on both sides in the above 
display, we obtain the claim (4.16). □ 
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